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1.  Introduction. 

The  central  limit  theorem  is  often  used  heuristically  to  justify  the 
approximation  of  histograms  for  data  by  the  normal  curve.  This  argument 
can  be  made  precise  through  the  following  model.  There  is  some  basic 
random  variable  X.  Take  the  sum  of  n  independent  copies  of  X,  and 
then  take  k  independent  copies  of  these  sums.  Provided  X  is  well- 
behaved,  n  is  large,  and  k  is  large  in  relation  to  n,  the  histogram 
for  the  sums  will  be  close  to  the  normal  curve.  In  this  way,  the  model 
rigorously  justifies  the  use  of  the  normal  curve  to  approximate  the  data. 

In  more  detail,  let  X^,X2, ...  be  independent,  identically  distributed 
random  variables.  Suppose  the  X^  are  integer -valued  and  have  g.c.d.  1: 

(1.1)  g.c.d.  (m:  P(Xi  =  m)>0)  =1. 

Suppose  too 

(1.2)  e|xJ|  <  oo  . 

Let  Sq  =  X1  +  •  •  •  +  Xn.  Take  k  independent  copies  of  Sn,  and  let  N^ 
be  the  number  of  these  sums  vfcich  are  equal  to  j:  the  notation  hides  the 
dependence  of  Nj  on  n  and  k.  Up  to  scaling,  the  counts  Nj  corres¬ 
pond  to  the  empirical  histogram  for  the  k  sums.  Of  course. 

Research  partially  supported  by  NSF  Grant  MCS-77-01665. 


1 


(1.3) 


E(NJ)  =  kPj,  where  Pj  =  P(Sn=j)  . 

Up  to  scaling,  the  numbers  p..  correspond  to  the  probability  histogram  for 

J 

the  k  sums.  When  n  is  large,  the  local  central  limit  theorem  implies 

that  the  p.  are  uniformly  close  to  an  appropriately  scaled  normal  curve. 

J 

In  this  paper,  we  study  the  random  variable 

(1.4)  M  .  =  max. (N.-kp. )  . 

nk  J  J  J 

Up  to  scaling,  this  is  the  maximum  difference  between  the  empirical 
histogram  of  the  k  sums  and  the  probability  histogram  of  the  sum.  This 
random  variable  is  the  key  to  understanding  the  maximum  difference  between 
the  histogram  and  the  normal  curve,  as  will  be  shown  in  a  future  paper. 

We  suppose 


(1.5) 


n  -*  oo  and  k/Vn  (log  n)^  -*■  oo  . 


Assumption  (1.5)  is  discussed  at  the  end  of  this  section.  Let 


(1.6) 


4  =  E(X1)  and  cr  =  Var 


In  Freedman  (1977)  it  is  shown  that  if  (1.1-1. 6)  hold,  then 


(1.7) 


m4 


cr/2rth 


log  n  -»  1  in  probability. 


Our  main  object  in  this  paper  is  to  prove  the  following  result,  which 
sharpens  (1.7)* 


(1.8)  Theorem.  Assume  (1.1-1. 6).  With  probability  approaching  one, 
maXj  (Nj  -kpj )  is  taken  on  at  a  unique  index  L^.  Moreover,  and 

are  asymptotically  independent,  being  asymptotically  normal 

and  being  asymptotically  double -exponential.  To  be  more  precise, 

let 


M 


nk 


(1-9) 


exp(-^u2)du 


> 


(1.10)  z  (x)  =  -  •  ■/  log  n-2  log  log  n  +  x  . 

n  V  oV2«n 

Then  ,  the  probability  that 

L^-nu  <  ya  V2n/log  n  and  < 

converges  to 

1 

-  ~2X 

$(y)  exp[-a/2  e  ]  . 

As  usual,  exp(x)  =  ex. 

This  theorem  is  illustrated  in  table  1  and  figure  1,  which  reports 
three  computer  simulations.  In  all  three,  the  basic  random  variable  X.^ 
took  the  six  values  1  through  6  with  equal  probability  l/6.  The  table 
shows  the  values  chosen  for  n  and  k,  the  lower  and  upper  quartiles  for 


TABLE  1. 

Computer  simulation  illustrating  theorem  (1.8). 


lower  quartile  upper  quartile  observed  value 


n 

k 

k/[-/n(log  np] 

Lnk 

M  . 
hk 

Lnk 

Mnk 

Lnk 

Mnk 

25 

100 

0.6 

83 

3.0 

92 

h.9 

srr 

2.5 

50 

500 

1.2 

169 

6.2 

181 

9-5 

185 

8.3 

100 

10,000 

10.2 

34i 

2k.  9 

358 

36.8 

358 

26.9 

FIGURE  1. 


Computer  simulation:  the  empirical  histogram  converges  to  the  normal  curve 


25  SUMMANDS#  100  REPETITIONS 


100  SUMMANDS.  10000  REPETITIONS 


the  asymptotic  distributions  of  and  M^,  as  computed  from  the 

theorem,  and  the  values  observed  in  the  simulation. 

In  the  first  line  of  table  1,  the  location  L  is  reported  as 

UK 

97:  so  N.-kp.  is  largest  when  j  =  97-  From  the  figure,  the  mode  of 
J  J 

the  empirical  histogram  is  at  82:  so  N.  is  largest  when  j  =  82.  In 

J 

general,  the  location  of  the  maximum  deviation  and  the  mode  are  very- 

different.  Compare  theorem  (1.8)  with  Diaconis  and  Freedman  (1978)* 

The  figure  shows  the  histograms  themselves.  The  horizontal  scale 

shows  j,  but  the  three  axes  are  arranged  so  that  the  means  nn  line 

up,  and  the  standard  deviations  oYn  cover  -the  same  physical  distance. 

The  vertical  scale  shows  (<r-/n)  x  (N./k)  X  100 %  that  is,  N.  is 

J  J 

converted  to  a  percent  (relative  to  k),  and  the  histograms  are  rescaled 
vertically  by  cry'n  to  have  the  same  physical  area.  For  a  discussion 
of  this  convention,  see  pp.  29  ff  and  275  ff  of  Freedman,  Pisani,  Purves 
(1978).  Thus,  the  maximum  difference  between  the  rescaled  empirical 
and  theoretical  histograms  is 


In  the  third  line  of  table  1,  for  instance,  that  is  about  5$» 

In  section  2,  we  argue  heuristically  that  our  problem  can  be  reduced 
to  finding  the  location  and  size  of  the  maximum  of  a  sequence  of 

independent,  normal  variables  having  mean  0,  but  variances  l-(j  / 2c  n) 
as  n  -*oo.  This  problem  is  solved  in  proposition  (2.2).  In  section  4, 
we  prove  theorem  (1.8).  Section  3  gives  some  technical  lemmas,  including 
bounds  for  the  probability  of  an  intersection  of  events,  and  approximations 
for  binomial  tail  probabilities. 
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In  this  paper  and  in  Freedman  (1977 )  the  condition  E|x^j  <  »  was 
assumed.  In  fact  lemmas  (4.1)  and  (4.3)  of  this. paper  replace  lemma  (15) 
in  Freedman  (1977),  and  hence  prove  theorem  (5)  of  that  paper  assuming  only 
E(X^)  <  oo . 


The  main  use  of  the  third-moment  assumption  in  this  paper  is  to  justify 
the  bound  on  the  error  in  the  local  central  limit  theorem  in  equation  (3.27). 
In  section  5 >  we  construct  an  example  to  show  that  new  limiting  behavior 
can  occur  when  E|x^|  =  oo .  The  argument  involves  Edgeworth -like 
corrections  to  sums  of  random  variables  without  third  moments, and  may  be 
of  independent  interest. 

Concerning  condition  (1.5),  if  k  » ■/n  log  n  but  k  =  0  (  /n  (log  n)^  ], 
the  form  of  the  norming  constants  in  equation  (1.10)  changes  radically, 
because  the  large  deviations  corrections  in  the  normal  approximation  to 
the  binomial  distribution  must  be  accounted  for.  This  is  worked  out  in 
detail  for  the  closely  related  problem  of  the  mode  of  the  histogram,  in 
section  4  of  Diaconis  and  Freedman  (1978).  If  k  =  0  (t/n  log  n),  then 
Freedman  (1977)  shows  that  the  maximum  deviation  does  not  converge  to  zero 
in  probability. 

2.  A  Heuristic  Argument. 

It  is  convenient  to  discuss  the  behavior  of  -kp^  separately  for 
four  zones.  The  zones  are  described  in  terms  of  constants  6  and  A, 
where  6  is  small  but  positive,  A  is  large  but  finite.  For  definite¬ 
ness,  we  will  assume  0  <  6  <  l/lO.  The  zones  are: 


7 


(2.1a) 


(2.1b) 


(2.1c) 


(2. Id) 


zone  I  :  |j-nn|  <  6ct  ./n 


zone  II  :  6cr-v/n  <  |j-nju|  <  AoVn 

zone  III:  AoVn  <  I J  -nj_i|  < 

3  /4 

zone  IV  :  <rn  <  |  j  -n^  |  . 


As  will  be  shown  below,  zones  II,  III,  and  IV  make  no  contribution  to  the 
maximum.  In  zone  I,  the  N.-kp.  are  approximately  independent  and  normal, 

tJ  t) 

with  mean  0  and 


Var (N.-kp  )  =  kp  (l-p  )  =  kp 

<J  J  cJ  J 


LLsuf] 

2  1  ’ 

2cr  n 


Thus,  L  ,  -nu  and  ycrV2rtn/k  M  ,  should  be  distributed  like  the  L  and  M 
nk  r  nk  n  n 

of  the  following  proposition. 


(2.2)  Proposition.  Let  U.  be  independent  normal  random  variables, 

J 

2  2 

with  mean  0  and  Var(U.)  =  1-j  /  2<r  n  ,  for  |  j  |  <  Bay'n  .  Let 

J 

M  =  oaXj  U. ,  and  let  L  be  the  index  at  which  the  maximum  is  achieved, 
n  j  j  n 

Then,  as  n  tends  to  <»,  the  probability  that 


Ln  <  yo-  y,2n/log  n  and  Mn  <  -/log  n  -  2  log  log  n  +  x 


converges  to 


1 

-  "  2  X 

9(y  )exp[  -cf/2  e  ]  . 
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As  in  (1.9),  $  is  the  standard,  normal  distribution  function. 

Proof.  For  -co  <  a  <  b  <  oo,  let  I  be  the  set  of  j  with 

—  —  —  at> 

I  j  1  <  bcr./n’  and 


auV2n/log  n  <  j  <  boV2n/log  n  . 

Thus,  ^'ab  is  always  finite.  Let  M&b  be  the  max  of  U^.  for  j  e  I 
Clearly,  if  a  <  b  <  c  <  d,  then  and  Mc(j  are  independent. 

Abbreviate 


(2.3)  wr(x)  =Vlog  n-2  log  log  n  +  x  and  y  =cr-/2 


We  will  show  that 


(2.4) 


P(Mab  <  wn(x)J 


1 
2X 


exp{-7[®(b)-$(a)Je  }  for  -oo<  a<b  < 


(2.5) 


lim  sup  P{M  <  w  (x)) 

-00a  u 
n  00  ^ 

"lx 

<  exp{-7  ®  (a)e  )  for  -co  <  a  <  0 


(2.6) 


lim  sup  P(Mh  ^  <  w 

n  00  00  ** 


(x)) 


1 

“  —X 


<  exp{-7[l-®(b)]e  2  )  for  0  <  b  < 


Granting  (2.4-6),  the  proposition  can  be  derived  by  an  elementary 
argument  as  follows.  First,  (2.4)  holds  even  for  infinite  a  or  b, 
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in  view  of  (2.5-6).  Now  let  and  be  independent  double- 

exponential  random  variables: 


P(Yi  <  x] 


1 

-pX 

exp[-7.  e  ] 


where 


y)  and  r2  =  7[l-4>(y)] 


9a 


SO 


Hie  function  vn(' )  defined  by  (2.3)  is  continuous  and  strictly  increasing; 
so  is  its  inverse,  vT1.  Equation  (2.4)  implies  that  the  joint  distribution 
of  Wn’L(M_<xy)  ^  w"1^*  )  converges  in  distribution  to  the  joint 

distribution  of  Y±  and  Y^.  Now  P(Y1  =  x)  =  P(Y1  =  Y  )  =  0.  So 

(2.7)  P{Ln  <  ya  -/2n/log  n  and  Mn  <  wR(x)) 


P(M  <  w(x)  and  M  >  M  ) 

-ooy  n  -coy  yco 


P(w“1(M  )  <  x  and  w_1(M  )  >  w-1(M 

n  -ooy  n  -ccy  n  v  yoo 


P^  <  x  and  Y±  >  Y^ 


1 

-  —  z 


1 

-T72 


f  1  2  ~  r  2  ~  1  r  2 

I  27le  exp[  ^  exp[-r2e 

J  -OO 


1 

7?  2 


-  ®(y)  r  I 

-00 


1  1 

'  2  Z  ~  2  Z 
7e  exp[-?e  ]dz  . 


)) 


]dz 


The  last  displayed  expression  equals 


1 

"2X 

4>(y)  exp[  -ye  ]  . 


Thus,  (2.4-6)  imply  (2.2). 
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We  turn  now  to  the  proof  of  (2.4).  We  will  write  v  for 

J 

2  2 

Var(U. )  =  1-j  /2cr  n.  Fix  an  x,  and  abbreviate  u  -  u  (x).  In 
J  n 

essence,  the  proof  of  (2.4)  is  the  following  computation: 


(2.8) 


log  P(Mab  <  w)  =  £  log  P(Ui  <  w) 


ab 


=  £  P{U,  >  w) 


ab 


=  I  A  /2n  .  i  •  exp(-  |  w2/v  J, 
by  (3.15).  Hie  symbol  =  means  "approximately  equal"  and  is  only  used  in 


heuristic  argument.  For  our  purposes 


(2.9) 


Vvj/w  =  l//log  n  . 


2  2 

In  the  exponent,  l/v.  =  1  +  j  /2cr n.  So 

d 


1  2  1  1X22 

(2.10)  ~  w  /v  =  log  n  -  log  log  n  +  ^x  +  j  /  (2cr  n/log  n) 

1 

_  "  J  x 

Thus,  log  PfM^  <  w)  is  approximately  y2  cr  e  times 


(2.11)  — -  -  ■ — -  £  exp{  -  |  j2/(2cr  n/log  n)) 

t^o^n/log  n  ^eIab 


Hiis  last  is  a  Riemann  sum  for 


— —  f  exp(-  5-  u2)  du  =  0(b)  -  0(a) 
V2n  Ja 


11 


We  now  indicate  the  details  required  to  justify  (2.8).  All  "0" 


and  ,,o”  error  terms  are  uniform  over  J  e  I  .  Note  that  a  and  b 

ah 

are  finite,  so  j  /n  =  0  (l/log  n).  In  (2.9),  then,  the  ratio  of 
the  two  sides  converges  to  one  as  n  00,  uniformly  over  j  e.  I  In 
(2.10), 


12  1  1  2 ,  j _ > 

-  w  —  -  g  w  (1  +  -V) 
j  2a  n 


Vj  2a  n 


=  0 (log  n)  •  o(j  /b) 


=  o(l)  . 

So  the  difference  between  the  two  sides  of  (2.10)  is 

2  .2 

I"2  ^2”  "|logn*^T  +  o(1) 

2a  n  ^  2a  n 

2 

=  ~  (w2-  log  n)  -t-  +  o(l) 

2a  n 

=  0(log  log  n)  •  0  (l/log  n)  +  o(l) 


=  o(l)  . 


1 

'  2  X 


times  the  j-th  term  in 


Thus,  the  ratio  of  P(Uj  >  w)  to  ^2  a  e 
(2.11)  goes  to  one  as  n  goes  to  unfinity,  uniformly  over  j  e I  As 


a  result. 


(2.12) 


P(Uj  >  w)  =  o(l) 


and 


(2.13) 


P(U.  >  w)  y^2  a  e 
3 


[®(b)  -  ®(a)]  . 


Now  (2.12-13)  imply  that 


£  log  P(U.  <  w)  =  £  [P(U.  >  w)  +o  (P(U.  >  w))] 

°  d£Iab  3 

also  converges  to  the  right  side  of  (2.13). 

This  completes  the  proof  of  (2.4).  Ihe  proof  of  (2.5)  is  very 
similar.  In  place  of  (2.9),  we  use  the  estimate 

•/vT/w  <  l/w  . 

To  modify  (2.10),  we  use  the  fact  that 

l/vj  >1  +  j2/2cr2n 
so 

12,  1  ,  ,  ,  J  1  1  2  .2  2 
2  w  /vj  >2  log  n  "  log  log  n  +  “x+~w  J  /2c  n  ; 

and  £  P(U.  >  Is  bounded  above,  to  within  o(l),  by 


13 


T^ir  e 


times 


log  n 


times 


1  W  £ 

&  -feF*  JeI 


cxp(-  |  w2j2/:tr2n)  . 


I 


_2 

Now  w  log  n  ->1,  and  the  last  display  is  at  most 

®(a)  +  o(l) 

because  the  normal  density  is  monotone  increasing  on  (-00,0).  The  o(l) 
corresponds  to  the  term  in  j  at  the  edge  of  I_ooa  •  This  completes 
our  discussion  of  (2. 5),  and  (2.6)  is  symmetric.  □ 


15a 


¥ 
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.  Probability  Approximations. 

The  random  variables  N.  introduced  in  section  1  have  a  joint 

J 

multinomial  distribution.  We  will  approximate  this  distribution  by 
getting  upper  and  lower  bounds  in  terms  of  independent  binomial  random 
variables.  Hie  binomial  variables  are  then  approximated  by  an  appro¬ 
priate  normal  distribution.  This  section  contains  the  basic  bounds 
and  approximations.  Hie  main  results  are  (3.2),  (5.16),  and  (3-17). 

Hie  first  lemma  is  an  unper  bound  for  multinomial  probabilities  due 
to  Mallows  (1968). 

(3.1)  Lemma.  Let  have  a  joint  multinomial  distribution. 

Then,  for  any  real  numbers  a^,a7, , . . .  ,aj  , 


P(M^  <  for  1  <  j  <  J)  <  TT  J  P(M.  <  a. )  . 


j=l 


J  -  J 


The  second  lemma  will  be  used  to  get  a  lower  bound  for  multinomial 
probabilities . 

(3.2)  Proposition.  Let  (a,3,P)  be  a  probability  triple,  and  let  c  3^ 

be  sub-o-fields  of  3.  Let  A,  and  G.  be  events  in  3..  Let  \* 

J  i  J  j’  j 

and  «  be  nonnegative  numbers.  Suppose  P(G. )  >  1-e.  and 
«  j  —  J 


°"  Vi 


c  • 


> 
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Then 


is  bounded  between 


TT 


j-i 


pj-i 

“  j=o 


and 


ttj 

11 3=i 


\*  + 


•j-i 


j  ^3=0  j 


Proof.  Induction  on  J.  □ 

Let  N  be  binomial  with  parameters  k  and  p,  where  0  <  p  <  1. 

The  next  set  of  results  give  approximations  for  the  distribution  of  N  which 
are  uniform  as  k  and  p  vary  over  a  wide  range.  Readers  may  be  surprised 
to  find  us  proving  versions  of  the  central  limit  theorem  for  coin-tossing, 
at  this  late  date.  However,  we  need  bounds  which  are  uniform  as  p  gets 
small}  to  stay  in  the  realm  of  the  central  limit  theorem,  we  will  require 
kp  to  be  large.  Our  results  involve  the  function 


(3.3)  g(x)  =  (ltx)  log(l+x)  -  x  for  -1  <  x  <  <*»  . 

Hie  function  g  is  strictly  convex,  strictly  decreasing  on  (-1,0), 
strictly  increasing  on  (0,oo).  it  satisfies 

(3.4)  g(x)  =  I"  x2  +  0(x^  )  as  x  ->0  . 

More  precisely, 

(3.5)  \  x2/l+x  <  g(x)  <  |x2  for  X  >  0  , 

with  the  inequalities  reversed  for  x  <  0.  The  function  g(x)  is  closely 
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f 


connected  to  a  function  which  arises  in  the  theory  of  large  deviations 
theory-  See,  for  example,  pp.  100-101  of  Kolchin,  Sevastyanov,  and 
Chistyakov  (1978). 

We  begin  with  a  special  case  of  theorem  (4b)  in  Freedman  (1973), 
restated  here  for  ease  of  reference. 

(3.6)  Lemma.  Let  u  >  0.  Then 

P{N  >  kp(l+u)l  <  exp{-g(u)kp)  . 

Combining  (3-5)  and  (3-6)  gives  a  version  of  Bernstein's  inequality: 

(3-7)  P(N  >  kp-nn)  <  exp{-|-m2/(kp+m))  for  m  >  0  . 

Parenthetically,  theorem  (4a)  in  Freedman  (1373)  implies 

(3.8)  P{N  <  kp-m)  <  exp{-i-m2/kp)  for  m  >  0  . 

Hie  next  result  is  a  variation  on  lemma  (7)  of  Freedman  (1977)/  and 
is  proved  the  same  way.  (The  condition  b  >  l/b  in  that  lemma  is  super¬ 
fluous.  ) 

(3.9)  Lanina.  Fix  e  >  0.  There  is  a  positive  B  such  that  the  ratio 
P(N  =  a+b)/P(N=a)  is  bounded  between  (1  +  c)  times 


1 
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for  all  Integers  a,  b  and  k  satisfying 


a  >  1/6,  a+b  >  l/6,  k  >  l/B,  a+b  <  6k,  |b|<By^a  . 

The  next  result  gives  a  bound  for  binomial  probabilities  P(N-v) 
in  terms  of  the  function  g.  Informally:  for  small  values  of  p, 
large  values  of  kp,  and  v'a  not  too  fax  from  kp,  P(N=v)  is 
asymptotic  to 


■  exp{-g[ (v-kp)/kp]kp)  . 

•j/2«kp 

This  is  a  combination  local  central  limit  —  large  deviation  result, 
holding  uniformly  in  small  p. 

(5.10)  Lemna.  Fix  e  >  0.  There  is  a  positive  8  that  P(N=v) 
Is  bounded  between 


(1 +  e)  — ^ —  exp(-g[ (v-kp)/kp]kp) 

■/  2«kp 

for  all  k,  p  and  v  satisfying 

p  <  B,  kp  >  l/&,  |v-kp|  <  Bkp,  I  v-kp|  <  B  -,/k  . 

Proof.  First,  suppose  v  is  the  integer  part  of  kp,  so  v  *  kp-6 
with  0  <  9  <  1.  Abbreviate  p  =  1-p,  so  k-v  =  kp+e.  By  Stirling's 
formula,  P(N=v)  is  asymptotic  to 


1 


times 


d-EP1 


-kp+e 


(i  +  -|) 

kP 


-kp-e 


The  first  expression  is  asymptotic  to  l//2«kp  .  The  second  is  asymptotic 
to 


-kp  -kp 

u  -  J)  a  *  4> 

Kp  kp 


E|y  taking  logs  and  expanding,  the  last  is  seen  to  be  nearly  1.  Thus, 


(3.11) 


P(N=[kp] ) 


More  explicitly,  the  ratio  of  the  two  sides  of  (3.11)  converges  to  1 
as  6-^0,  uniformly  in  k  and  p  satisfying  our  conditions,  namely, 
p  <  8  and  kp  >  l/s. 

We  now  apply  (3*9),  with  a  =  [kp]  and  b  =  v-a  to  see 


(3.12)  P(N=v )/P(N=[kp] )  «  exp[ -g(b/a)a]  . 

First,  as  easily  verified,  the  conditions  of  (3.9)  hold. 

1 


(3.13) 


Wh' 


w  1 


Second, 


indeed  a  =  kp  +  0(l)  and  b  =  v-kp  +  0(l)  is  small  relative  to  kp, 
by  assumption.  (The  "0(1)"  terms  are  bounded  as  5  ->0,  uniformly  in 
k,  p  and  v  satisfying  our  conditions.  )  Likewise, 
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(3.14)  -  1  i 

indeed,  this  expression  is 

(i  +  4)b/d  -  ’ 

kp  kp 

and  b  =  v-a  =  v-kp+O  (l )  is  small  relative  to  kp  and  to  kp.  This 
completes  the  proof  of  (3.12). 

Recall  that  a  =  [kp]  and  b  =  v-a.  The  final  step  in  proving 
(3.10)  is  to  show  that 

|g(l^£)kp.g(|)a| 

is  uniformly  small.  This  tedious  piece  of  calculus  is  omitted.  a 
We  now  recall  a  bound  on  the  tail  probabilities  of  a  normal 
random  variable: 

(3.15)  Suppose  U  is  a  normal  random  variable  with  mean  0  and 
variance  1.  Let  w  >  0.  Then  P{U  >  w}  is  bounded  between 

1  1  ,  1  2N  1  1  /  1  2n 
—  - T  p  w  )  and  — =  -  exp(-  p  w  )  . 

fen  w  +  i  /2n  w 

The  following  result  is  a  generalization,  as  one  sees  by  taking 
f(x)  =  |>  x2. 
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(3.16)  Proposition,  (a)  Let  f  be  convex, and  suppose  f(oo)  =  ».  Then 
J  exp[-f (u)]  du  <  fi-fij  exp[-f(x)]  . 

(b)  Suppose  l/f'  is  convex.  Then 

f 00  ! 

exp[-f(u)]  du  >  - exp[-f (x)]  . 

JX 

Proof.  Claim  (a).  Let  F  =  exp(-f).  Then  F(oo)  =  0  and  F*  =  -f'F. 
Now  f '  is  monotone  so 

/  00  /■  00 

f'  (x)  |  F(u)  du  <  I  -F'  (u)  du  =  F(x)  . 

Jx  Jx 

Claim  (b).  Let  F  =  (l/f'  )  exp(-f ).  Then  F(oo)  =  0  and 


F  =  -(l+£-^)  exp(-f).  So 

ft 


[1  + 


SlkLj  [" 

f<  (xf  Jx 


foo 

F(u)  du  >  I  -F'  (u)  du  =  F(x)  .  □ 

Jx 


Note.  If  f  is  convex  and  f'  is  concave,  then  l/f’  is  convex.  This 
is  the  case  for  our  function  g.  The  concept  of  complete  monoto- 
niclty  is  relevant  here:  see  section  XIII. t  of  Feller  (1971). 


The  next  lemma  extends  corollary  (8)  of  Freedman  (1977).  It  provides  a 
bound  for  binomial  tail  probabilities,  in  terms  of  g,  analogous  to  (3.15). 

(3.17)  Proposition.  Fix  €  >  0.  There  is  a  6  >  0  such  that 
P(N  >  kp-Hn)  is  bounded  between 
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exp[-g(m/kp)kp] 


(i  +  e)  J_  M 
~  V2Z  m 

for  all  k,  p  and  m  satisfying 

p  <  8,  kp  >  l/8,  Vkp/ 8  <  m  <  8kp,  m  <  8/ k  . 

Proof.  We  break  the  tail  probability  up  into  two  parts: 

P(N  >  kp-Hn}  =  P1+P2  , 

where 

P^  =  P{kp-nn  <  N  <  kp+2m)  and  P2  =  P(N  >  kp+2m)  . 

Later,  we  will  show  that  P^  is  negligible,  but  that 

(3.18)  P  »  — -  exp [ -g (m/kp )kp 3  . 

1  /2n  m 

First,  some  preliminaries.  To  simplify  the  notation,  let 
G(u)  =  exp[-g(u/kp)kp]  . 

Clearly, 

(3.19)  G(m)  is  small  relative  to  — —  ^2.  G(m)  , 

-/2nkp 

because  m  is  small  relative  to  kp,  by  assumption.  Likewise,  in  view 

of  (3.5) 
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(3.20)  G(2m)  is  small  relative  to  — -  G(m)  , 

7  2«-  m 

because  m  is  small  relative  to  kp,  but  large  relative  to  -/kp  . 
Next,  we  claim 

(3.21)  £°°  G(i)  is  bounded  above  by  (l+e.  )  ^2.  Q(m)  . 

i=m+l  1  m 


Indeed,  G  is  monotone  decreasing,  so 


i=m+l 


G(u)  du  < 


1 

g'  (m/kp) 


G(m) 


by  (3.16).  But 


gi  (_2L)  =  load  +  J2.)  »  J£ 

g  vkp'  ogvX  kp  '  kp 


because  m  is  small  relative  to  kp,  by  assumption.  This  proves 

(3.21) . 

We  are  now  ready  to  prove  part  of  (3.18),  by  estimating  P  from 
above.  Let  v  range  from  [kp]+m+l  to  [kp]+2m,  so  P]L  =  P(N=v). 

As  (3.10)  implies,  is  bounded  above  by 

(3.22)  (l+€?)  -----  F  G(v-kp)  . 

l/2rt kp  v 

Because  G  is  decreasing,  the  expression  (3.22)  is  bounded  above  by 
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1 


G(i)  . 


In  view  of  (3.19)  and  (3-21), 


(3.23)  is  bounded  above  by 

(1+0  — -  exp(  -g  (m/kp  )kp}  . 

3 


We  are  now  ready  to  prove  the  rest  of  (3.18),  by  estimating  Pg 
from  below.  Indeed,  as  (3.10)  implies  Pg  is  bounded  below  by  (1-e^ ) 
times 


(3. 24)  - —  r  G(v-kp)  >  -±-  l2m  G(i)  . 

y&rkp  v  f/2nkp  i=m+l 


The  right  side  of  (3- 2k)  is  T^-Tg-T^.,  where: 


Tx  =  —=  GU) 

Y2itkp  i=m 
r  00 

>  -  ■■  ■  I  G(u)  du 
-/2nkp  Jm 

9  ~  Q(m)  by  (3.16)  , 

V2rt 


=  - —  G(m)  =  o[^E  G(m)  ]  by  (3.19) 

2  i/SMqp  m 
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y-=  r  o(D 

■/2itkp  i=2m+l 

<  (1+S  )  ^  G(2m)  by  (3.21) 

x  ./2k 

=  0[I^G(m)]  by  (3.20), 

m 

on  recollecting  that  j/k p/m  is  small  by  assumption.  This  completes  the 
proof  of  (3-18). 

Our  last  job  is  to  show  that  P?  is  negligible  by  comparison  with 
P^.  But 

P2  =  P(N  >  kp+2m )  <  G(2m) 

by  (3-6);  now  (3.18)  and  (3.20)  can  be  used  to  complete  the  argument.  □ 

(3.25)  Remark.  Under  the  conditions  of  (3.17),  P(N  <  kp-m)  can  be 
bounded  by  the  same  expression.  The  argument  is  almost  identical, 
because  (3.10)  is  symmetric  around  kp.  The  only  difference  comes  at 
the  end: 

P2  =  P(W  <  kp-2m)  <  exp(-2m2/kp)  by  (3.8) 

<  G(2m)  by  (3-5)  . 


2k 


Many  of  our  arguments  use  the  local  central  limit  theorem.  We  state 
two  versions  for  ease  of  reference.  For  the  first  version,  assuming  only 


a  finite  second  moment,  see  page  517  of  Feller  (1571).  For  the  second 
version,  assuming  a  finite  third  moment,  see  page  197  of  ftetrov  (1575). 

(3.26)  Suppose  (l.l),  and  E(X^)  <  °°,  and  (1.6).  Then  uniformly  in  j, 

P(S  =j)  =  — - —  exp{-  |[  (j-nn)/a/n  f)  +o(l//n) 

(j 

(3.27)  Suppose  (1.1),  (1.2),  and  (1.6).  Then  uniformly  in  j 

P(S  =j)  »  — exp{-^[  (j-nn)/or/n]2)  +  0(l/n) 

n  <ry2*n  * 
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4.  Proof  of  theorem  (1.8). 


The  first  step  in  the  proof  is  to  show  that,  with  probability- 
approaching  one,  the  maximum  does  not  occur  in  zones  II,  III,  or  IV 
as  defined  in  (2.1).  This  part  of  the  argument  does  not  require 
third  moments  nor  the  full  force  of  (1.5)*  Thus,  for  lemmas  (4.1),  (4.3), 
and  (4.5),  we  assume  (l.l)-(l .6)  with  (1.2)  replaced  by  E(X^)  <  °o  and  (1.5) 
replaced  by  n  ■>»  and  k//n  log  n  ■>«>.  We  begin  with  zone  IV. 

(4.1)  Lemma.  Let  y  be  any  positive  number.  Let  be  the  ™»v--irrmm 

of  Nj-kp^.  over  zone  IV,  namely  the  set  of  j's  satisfying  |  j-npl  >oti5/4. 

Let  0nk  be  ^  P^ability  that  >  y(k  log  n)1//2/n1//1+.  Then 

6  ,  -*  0  as  n  -*  oo . 
nk 

Proof.  To  begin  with,  is  bounded  by  the  sum  over  j  in  zone  IV  of 

PtNj-kpj  >  y(k  log  n)l/2/nl/U)  . 

By  Chebychev' s  inequality,  the  last  displayed  probability  is  at  most 

-2  1/2 

y  n  p./log  n  . 

J 

Summing  over  j  in  zone  IV  shows 

0nk  -  n)  ’  P(|Sn-np|  >  cr?^)  . 

Using  Chebyshev' s  inequality  again, 
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P(|Sn-n(j|  >  ar?^)  <  n”1/2  . 


0nk  -  y"  /log  n  *  D 


We  turn  now  to  zone  III.  It  will  be  convenient  to  abbreviate 


(b.2) 


m  =  / — -  log  n 

V  cr/2nn 


This  m  appears  in  (1-7)  and  is  the  leading  term  in  (1.10)}  it  will  have  some 
connection  with  the  variable  m  in  section  3. 

(4.3)  Lemma.  Let  0  <  A  <  <».  Let  be  the  maximum  of  N^-kp^ 

over  j  in  zone  III,  namely  the  set  SA  of  j  with  fitan1^2 <  |  J -n^ij  <  cna^^. 
Fix  any  e  with  0  <  e  <  1.  Then  there  is  an  A  =  A(e)  so  large  that 


P{M^  >  (l-e)m)  0 


as  n  and  k  tend  to  infinity,  satisfying  the  growth  condition  log  n 

1  2 

Proof.  Let  6  satisfy  0  <  6  <  jj-(l-e)  .  Recall  that  =  P(Sn=j). 

In  view  of  the  local  central  limit  theorem  (3.26),  there  is  an  A  >  0 
and  a  finite  n^  such  that  p^  <  $) cr/2nn  for  all  j  in  provided 
n  >  n^.  From  inequality  (3.7), 


^ >  p(»d  >  kPpd-'W  <  expl-ifl-c)2  t g  pt-e).] 

w 
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2 

We  must  now  bound  m  /[kp.+(l-e)m]  from  below.  The  choice  of 

J 

A  forces  kp.  <ke/oV2nn.  The  growth  condition  on  n  and  k  forces 
J  — 

(l-e)m  <  k@/cry'2nn  eventually.  Then, 


2 

m 

kp.  +  (l-e)m 

J 


> 


2ke^V2nn 


log  n 
20 


and  the  right  side  of  (4.4)  is  smaller  than  l/n.  Finally, 

P{Ma  >  (l-e)m)  is  bounded  above  by  the  sum  over  jeSA  of 

P(N.  >  kp.  +  (l-e)m)  . 

0  J 

3/4  . 

This  sum  comprises  at  most  2cm  terms,  each  bounded  by  l/n.  This 
proves  (4.3 ).  O 

We  turn  now  to  zone  II. 

(4.5)  Lemma.  Fix  A  >  6  >  0.  Let  be  the  maximum  of  N^-kp^.  for 
j  in  zone  II,  namely  the  set  S2  of  j  with  BoVn  <  |j-n^|  <  Acr^n . 

Let  m  be  defined  by  (4.2).  Then,  for  e  >  0  sufficiently  small, 

Ptr^  >  (l-e  )m)  -»  0 

as  n  and  k  tend  to  infinity  satisfying  the  growth  condition  k//n  log  n  ->  ». 

Proof.  In  view  of  the  local  central  limit  theorem  (3.26),  there  is  a 
6^  >  0  and  a  finite  n^  such  that  for  n  >  n^. 
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(U.6)  <  (l-61  )/c>y2nn  for  all  j  satisfying  | J -n|a |  >  Bay'S  . 


p 

Choose  €  positive  with  (1-e)  /(l-61)  >  1.  Inequalities  (5.7)  and 
(4.6)  imply,  after  some  algebra: 


(4.7) 


P(Nj  >  kpj  +  (l-e)m} 


<  exoT -  —  B  i£S- 

exPL  2  p  1+6 


idiere  P  =  (l-e)  /(l-B^  >1  by  our  choice  of  6,  and 


e 


n 


l-e 


1  11  11 
cr2  (2*)^  n^(log  n)2/k2  ->0 


by  the  growth  condition  on  n  and  k. 


Choose  0’  with  1  <  P’  <  P.  For  large  n,  the  right  side  of  (4.7) 
-6 9  /? 

is  at  most  n  .  Sum  (4.7 )  over  j  in  Sg,  comprising  at  most 
2 Pa -fix  terms,  to  see  that 
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P{ft>  >  (l-e)m)  <  2Ao-</n  n""^  -*  0  .  □ 

We  now  proceed  to  zone  I.  Fix  5  with  0  <  6  <  l/lO.  For 
-oo<a<b<°o,  let  I  be  the  set  of  j  with  |  j-np|  <  &cr  >/n 

and 

(**■•8)  a£r->/2n/log  n  <  j-n^  <  bcr  «j/2n/log  n  . 

These  intervals  are  all  finite  and  nonrandom:  I  corresponds  to  all 

— 00  00 

of  zone  I.  Let 


W  be  the  max  of  N.-kp  for  j  in  I  . 
aD  0  1  ah 

To  state  the  main  result  concerning  zone  I,  recall  zn(x)  from 
(1.10).  As  in  (1.9)*  let  $  be  the  standard  normal  distribution 
function.  We  will  now  need  finite  third  moments  and  the  growth 
condition  (1.5). 

(**•9)  Proposition.  Assume  (l.l)-(1.6).  Let  -«<  a<b <  c  <  d< 00  .  Then 

P{WabSzn(30  Wcd  ^  zn(y>3 

converges  to  expC-c--*/^  Q),  where 


Q  =  [<S(b  )-<t(a)]e 


1 

-2X 


[<J>(d)-&(c)]e 


'2y 
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The  proof  of  (4.9)  is  a  bit  complicated.  Here  are  some  preliminary 


estimates . 


(4.10)  Lemma.  Hie  bounds  given  below  apply  uniformly  to  J  e  I  , 
1  ■  *00  00 

and  the  ”o”  errors  are  uniform  in  j  e  I  as  n  -*  oo. 

-oooo 

(a)  (j-n4)2/cx2n  <  62  <  l/lOO. 


(b )  p  is  bounded  between 
J 


U  -  |  (liS)  ♦  0(^)1 


cr  J 2«n 


cr  n 


where 


1  .2 

=  2  s 


(c) 


1/[1-|  (1+a^) 


2  J 
a  n 


is  bounded  between 


1  +  |  ^6 


) 


(J-nH)2 

2 

cr  n 


2 

where  b„  is  a  function  of  6  only  and  b_  =  0(6  )  as 
o  o 

6  -♦  0. 

Note.  The  notation  in  (b),  although  not  standard,  is  convenient  for  our 

f 

purposes  and  will  be  used  throughout.  To  spell  (b)  out,  there  is  a 
sequence  en  >  0  with  €r  log  n  0,  such  that  for  all  n,  and  all 
j  in  I  ,  the  probability  p.  is  bounded  above  by 

“00  00  J 


[1 


/2nn 


and  below  by 


■  y2*n 


[1  -  |(l+a6)  ]  (1- 


2 

cr  n 


€n} 


Proof.  Part  (a)  is  trivial,  and  then  (b)  follows  from  the  local  central 
limit  theorem  (3.2 6),  using  the  estimate 


t  ^  -x  .  ,  ,12 

1-x  <  e  <  1-x  +  2  x  • 


Here,  x  =  (j-nn)"/2cr2n  <  82,  so 


x-i  X2  =  (1  -i  x)x>  (1  -T-  B2)  x 


Part  (c)  follows  from  the  identity 


1 

1-x 


1 


X 


+ 


2 

x 

1-x 


O 


Ihe  probability  in  (4.9)  will  now  be  estimated  frcm  above  and  below. 
An  upper  bound  derive*  from  Mallows' inequality  given  in  0.1): 


(4.11)  The  probability  that  <  zn(x)  and  Wc ^  <  zn(y)  is  at  most 


inhere 


^j(x)  =  P(Nj  <kpj+zn(x))  . 
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The  rest  of  the  upper-bounding  is  very  similar  to  the  lower-bounding, 
so  details  are  omitted. 

The  lower  bound  will  be  derived  from  lemma  (3*2).  In  that  lemma 

take 


(**-12) 


*3  ’  ("j  £  1‘V2n(*))  tor  3  6  rab 


(N.  <  kp  +z  (y)}  for  j  e  I 
J  j  n  cd 


To  define  G.,  let 
3 


(4.13a) 


Kj  be  the  set  of  i  with  n(t-6cr/n  <  i  <  j  , 


(4.13b) 


=  L  P± 

J  ieK. 

3 


(4.13c) 


G-  =  (£  N.  >  kg.  -M)  , 
J  ieK.  1  J 
3 


where 


(4.13d) 


M  =  (k  log  n) 


Slnce  I-oq  oo  includes  at  most  2Bon  indices,  (4.10b)  shows 


(4.14)  g  <  0.8b  for  all  jel  ,  for  all  large  n 

J  -OC  OO 


Lemma  (3.2)  also  involves  cr-fields  3  ,  defined  as  follows: 
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(i*-.  15 )  3.  is  the  cr-field  spanned  by  the  N,  as  i  ranges  over  k.. 

J  1  0 

To  apply  lemma  (3.2),  we  must  estimate  P(G  ). 

(4.16)  Lemma.  Define  G_.  by  (4.13).  Then  the  sum  of  l-P(G^) 

0  J 

over  all  i  e  I  tends  to  0  as  n  and  k  tend  to  infinity  satisfying  (1.5). 

Proof.  Clearly,  )T  N.  is  binomial:  the  number  of  trials  is  k, 
ieKj  1 

and  the  success  probability  is  g..  From  (3.8), 

J 

1-P(G. )  <  exp[-  \  1^/kg  ]  . 

Replace  by  the  upper  bound  (4.l4):  eventually, 


l-P(Gj)  <  exp(-  |  M^/kB)  =  n"1/26  <  n-5 

1 

2 

for  sill  j  e  ^_oooo  *  But  only  comprises  0  (n  )  terns,  p 

We  are  now  ready  to  establish  the  basic  lower  bound. 

(4.17)  Lemma.  Define  g.  and  M  by  (4.13),  and  z  (x)  by  (1.10). 

J  ** 

Let  N*  be  binomial,  with  success  probability  p'.  =  p./ (l-g.  .  )  and 
J  J  J  J 

the  number  of  trials  k'.,  the  integer  part  of  k(l-g._.)+M.  Let 

J  J 

\j(x)  =  P(N(  <  kpj  +zn(x))  . 

Then  P{W  .  <  z  (x)  and  W  .  <  z„(y)}  is  bounded  below  by 
&o  n  cci  —  n 


n--  Mx)  •  Ved\j(y)  -l 


J€lab  J 


jel 


[l-P(Gj)]  . 


Proof.  Given  3..  ,  as  defined  in  (<4.15),  the  conditional  distribution 

J 

of  N.  is  still  binomial;  the  conditional  success  probability  is  p', 

3  J 

as  defined  above;  the  number  of  trials  T.  is  an  3.  -measurable  random 

J  J 

variable: 


T.  =  k  -  V  N.  . 

3  ieK.  .  i 

J-l 


On  G .  ,  however,  T.  <  k'. .  So 

,1-1  3-3 


P{N  <  kp  +zn(x)|3  i]  >\  (x)  on  G  : 


heuristically,  the  more  you  toss  the  coin,  the  more  heads  you  get. 


Lemma  (3.2)  completes  the  proof.  □ 


We  must  now  estimate  X..  Here  are  some  preliminaries. 


(t.l8)  Lemma.  Define  X...  as  in  (<4.17 ). 

J 


So, 


\  .  (x)  =*  PIN’.  <  k’.p*.  +  h) 
J  J  ~  J  J 

where 

h  =  z  (x)  +kp.  -  k’.p'.  . 
n  3  3  3 


Hie  bounds  and  error  terms  given  below  are  uniform  over  jel  ;  the 

-cOoo 

"0"  and  "o"  are  as  k  and  n  approach  infinity,  satisfying  condition 
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(1.5)  that  k/y'n  (log  n'p  ■*  <x 


(a) 


k  .p . 
J  J 


is  bounded  between 


— =  ii -i  dso 

cr  y2nn  cr  n 


][1  +o( 


log  n 


)] 


with  a  of  (^.lOb). 

6 

(b)  h  =  zn(x}[l+0(n‘lU)] 

(c  )  h"  =  o  (k'.p1.  )2 
J  J 

2 

(d)  h  /k'p'  is  bounded  between 

J 

2 

log  n  -  2  log  log  n  +  X  +  |  (l+2b&  )  -  +  o  (l ) 

a  n/log  n 

with  the  b&  of  (4.10c). 

(e)  Recall  the  definition  of  g(x)  at  (3.3). 

Then  g(h/k'.p'.  )k'.p'.  is  bounded  between 
J  J  JO 

O 

2  loe  n  ■  loe  lc,e  n  + 1  x  +  £  (i+_2b  )  —  +  o  (l)  . 

cr  n/log  n 


and  k1.  given  in  (4.17), 

J 


where  0  <  0  <  1.  By  (4.13d)  and  condition  (1.5),  (M-e  )/k  =  o(l/log  n); 
and  gj  <  &  eventually,  by  (4.14).  Ihen  (4.10b)  completes  the  argument 
for  part  (a). 


Proof.  Part  (a’>.  From  the  definitions  of  r>' 


(4.19) 


k'.p'.  =  kp  .  [1  +  M~e^ - _] 

J  J  J  k  (1-g  ._x  ) 


56 


).  Recall  h  as  defined  in  the  statement  of  the  lemma. 


Continuing  from  (4.19), 


15TT  V®  1-8,.! 


as  in  (a),  using  the  definition  (1.10)  of  z  (x). 

Part  (c).  This  is  immediate  from  (a),  (b),  and  condition  (1.5). 
This  is  the  first  time  that  the  full  force  of  (1.5)  has  been  used. 

p 

Part  (d).  Using  (a),  (b),  and  (4.10c),  h  /k'.p'.  is  bounded 
between 


(log  n  -  2  log  log  n  +  x)[l  +  |  (l+bg)  ^■~^h2-]  [i  +  0(_i__)]  . 

tr  n  K 


Of  course,  (2  log  log  n-x)/log  n  <  b&  eventually,  proving  (d). 

Part  (e).  This  follows  from  (c )  and  (d),  by  using  inequality  (3-4). 

The  proof  of  proposition  (4.9).  We  pick  up  the  argument  from  (4.17). 

The  problem  is  to  put  a  lower  bound  on  n  \.(x),  the  other  factor 

13  ab  3 

being  similar.  Defining  h  as  in  (4.l8), 


l-\.(x)  =  P(N'.  >kp.+z  (x)} 


=  p{N’.  >  k'.p'.  +  h)  . 
J  3  3 


Fix  an  e  >  0.  Then  proposition  (3.17)  implies  that  eventually,  for  all 
jel  ,  l-\.(x)  is  bounded  between 


1 


(4.20) 


(1  +  0  (k'p'/2jt)1//':  i  exp[  -g (h/k'.p'.  )k'  p'. ' 
—  J  J  JJJJ 


''Eventually"  means  for  all  k  and  n,  with  n  and  k/Vn  log  n 
sufficiently  large. 

We  pause  to  verify  the  conditions  of  the  lemma  being  appealed  to: 
p'.  =  p./(l-g.  0  is  small  by  (4.10b)  and  (4.l4)j  k'.p'.  is  large  by 


.2  /,  ' 


(4.l8d);  h^/k'.p'.  is  large  by  (4.l8d);  h/k'p'  is  small  by  (4.l8e)j 

1/2  ^  ^  d  d 
and  h/k  '  is  small  by  (4.l8b).  Thus,  the  bound  (4.20)  is  established. 

The  exponent  in  (4.20)  was  estimated  in  (4.l8e);  the  factor  k'.p'. 

in  (4.l8a);  and  the  factor  h  in  (4.l8b).  We  conclude  that  eventually, 

for  all  j  in  I  ,  l-\.(x)  is  bounded  between 

"00  00  J 


(l+e )  (1+c  )ct  e  X^1 2 

—  —  5 

times 

1  '  -  exp{-  r-  (l+2b  )  - ) 

**'2*  yPn/log  n  n/log  n 

2 

where  cg  is  a  function  of  6  only  and  c^  =  0(6  )  as  6  0. 

Arguing  as  in  proposition  (2.2),  we  get  a  lower  bound  for  the 
probability  in  (4.9),  of  the  form  (l-36)(l-d^)exp(-Q&),  where 


2  e"X^2 


t®(fBb)  -®(fBa)] 


with  dg  =  0(&L)  and 
in  j  was  critical. 


+  &V2  e'y'2  [ ® (fBd )  -$(f&c)] 

2 

f&  =  l+o(5  ).  It  is  at  this  point  that  the  uniformity 
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An  upper  bound  of  similar  form  can  be  obtained  by  essentially  the 
same  argument,  starting  from  (4.11)  instead  of  (14.17).  Then  the  proof 
can  be  completed  as  in  (2.2).  Q 

Our  last  main  job  in  proving  (1.8)  is  showing  that  max(N.-kp  )  is 

1  J 

assumed  at  a  unique  index  j,  with  probability  approaching  one  as  n  and 
k//n  (log  n 'p  tend  to  <»  .  The  following  heuristic  discussion  may  make 

the  argument  easier  to  follow.  Inspection  of  (1.8)  suggests  that  the 

l/2 

maximum  is  likely  to  occur  only  for  j’s  within  O(n/log  n)  '  of  npi. 
Call  these  the  critical  j's. 

(*4.21 )  There  are  O(n/log  n)^^  critical  j's. 


Recall  m  from  (4.2).  The  range  of  likely  values  for  the  maximum  is 
from 


m(l  -  '  lo.^loS,  n  _  ..  b 

log  n  log  n/ 


m(l 


2  loglog  n  _  b  ' 
2  log  n  ”2  log  n' 


to 


2  *°Slo&  n  +  -A—)1/2  4  m(i 

log  n  log  n'  x 


2  loglog  n  b  , 
2  log  n  2  log  n' 


Call  these  the  critical  values.  In  what  follows  we  write  a^  ~  t>n  if 

lim  a  /b  >0  and  lim  a/b  <  00 . 

-  n  n  n  n 

1  _1  _1 

p  r  "*  p 

(4.2.’)  There  are  the  order  of  m/log  n  ~  k  n  (log  n) 
critical  values  for  the  maximum. 


Each  of  these  critical  values  for  the  maximum  corresponds  to  some  value 

i  for  an  N.  of  around 
J 
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kp..m(l  -  ?  n)1^ 

log  n 


=  kp.+  (kp.)1/2  (log  n-2  loglog  n)x/'~  . 
J  0 


Now  N.-kp.  is  essentially  normal  with  mean  0  and  variance  kp.  «•  k/-/n, 
J  J  J 


2  j  (i-kp  ) 

P(N.=i)  = -  exp[--  -  7-  J — ] 


/2«kp . 


k-1^2  n^^  exp[-i  logn  +  loglog  n] 


-1/2  -1/1+ 

=  k  n  log  n  . 


Furthermore,  the  N.  are  nearly  independent.  Hie  chance  that  the 

J 

maximum  occurs  at  two  different  indices  is  bounded  by  the  sum,  over 
the  critical  j,  i  and  i'  satisfying 


(4.23) 


i-kp.  =  i'-kp  , 
cJ  J 


P(N.=i  and  N.t  =i'  )  ~  k-1n  2  (log  n)2 
J  J 


The  number  of  critical  i's  was  estimated  in  (4.22):  to  each,  there 
corresponds  at  most  one  i'  by  (4.23),  "at  most"  because  i»  has  to 


be  an  integer.  The  number  of  pairs  j,  j'  is  O(n/log  n)  by  (4.21). 

So  the  chance  that  the  maximum  occurs  at  two  different  indices  is  of  order 


n(log  n)-1  .  k1//2n"1//4 (log  n)"1^2  •  k^n-1/2  (log  n)2 


.  -1/2  1/4  /  ,1/2 

=  k  n  '  (log  n)  ' 


This  last  quantity  tends  to  zero  because  of  the  growth  condition  (1.5). 

Returning  to  rigorous  argument,  recall  zn(x)  defined  in  (1.10) 
and  Iafe  defined  by  (4.8).  The  main  estimate  is  the  following. 

(4.24)  Lemma.  Assume  (l.l)-(1.6).  Fix  positive  finite  numbers  a  and 

b.  Then,  uniformly  over  pairs  of  indices  j  /  j'  in  I  ,  and  pairs 

-aa 

of  integers  i,  i'  satisfying 

(4.2  5)  zn ( _b )  <  i-kpj  =  i'-kp^.,  <  zn(t) 

we  have 

(’i.26)  P (N .  =  i )  =  0( k’^n"1^  log  n) 

J 

and 

(4.27)  P(Nj,  =i'  |Nj=i)  =  0(k"1//2n"l/l4  log  n)  . 

Proof.  The  first  assertion  (4.26)  follows  from  lemma  (3.10),  the 

requisite  estimates  for  p.  being  given  by  (4.10).  To  make  this 

J 


hi 


valid,  the  conditions  of  (3.10)  must  be  verified.  Hien,  one  argues 

2 

from  (4.25)  and  (1.5)  that  (i-kp  .Y  =  o(kp. )  ,  so 

J  J 


g[  (i-kp  )/kp 
J  cJ 


kPd 


\  (i-kPj  )2/kPj  +  °'0-) 


But,  using  (4.22)  again. 


(i-kp.)2  =  — — —  [log  n-2  loglog  n  +  0(l)]  . 
J  cr  V2*h 


And  by  (4.10), 


fl  +  0( 


1 

log  n 


)]  • 


So 


g[  (i-kp .  )/kp  ]  kp  =  -  log  n- loglog n  +  0(1)  . 
J  J  J  ^ 


We  omit  the  other  details  in  the  proof  of  (4.?6). 

For  the  second  assertion  (4.27),  given  N.  =  i,  the  conditional 

distribution  of  N^t  is  binomial  with  success  probability 

P  =  p.t/(l-p.)  and  number  of  trials  k  =  k-i.  Some  preliminary 

estimates  are  needed  before  appealing  to  lemma  (3.10).  All  ”0" 

and  "o"  estimates  are  uniform  over  j  ^  j'  in  I  and  i,i' 

-aa 

satisfying  (4.25),  as  n  and  k/Vn  (log  n'p  tend  to  infinity. 
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We  will  show  that 


(U.2B) 


kp  =  kp  [l+o(l/log  n)] 

0 

(4.29)  (i'-£p)2/kp  =  log  n-2  log  log  n  +  0(l)  . 

Assume  these  bounds  for  the  moment.  Another  application  of  lemma  (5-10)  shows  that 

P(N  =i'|N  =i)  is  of  order 
O  J 

(kp)-1/2  exp[-i  (i'-kp)2/£p]  . 

Now  use  (4.28)  and  (4.29)  as  before  to  complete  the  proof. 

We  now  prove  (4.28)  and  (4.29).  Let 


6  =  kp  -  kpj , =  (kpj-i  )p  , / (1-Pj )  . 


Abbreviate  h  =  i'-kp.,  =  i-kp..  Then  (b)  of  Lemma  (4.10)  implies 

J  J 

6/h  =  -Pj,/(l-Pj)  =  0(l/y/n).  Condition  (4.25)  makes  h  of  order 

k1//£Ln  ly/2  (log  n)1//2,  and  then  0  =  0  (k^n-^  (log  n)1/2).  Finally 

kp.,  i«  of  order  k//n,  again  by  (b)  of  lemma  (4.10).  To  summarize: 
J 


(4.50)  e  =  Of k1//2n-3/4(log  n)1/2],  h  3 


,1/2  -1/4,.,  sl/2 
k  n  (log  n)  '  , 


kpj,  »  kn 


-1/2 


Now  we  can  prove  (4.28): 


A  A 

kp  =  kp^. ,  +  0 


=  kp  ,  +  0[k1/2n"^/1+  (log  n)1/2] 

J 

=  kp.,  {l+0[k-1/'  n-1/^(log  n)1//2)} 
J 


kPj,  [l+o(l/log  n)]  . 


To  prove  (4.29),  note  that  i*-kp  =  h-6.  Now 


(i'  -fcpr/kp  =  (h2-29h+e2)/kp  =  h2/£p  +  o(l)  . 

From  (4.25) 

h2  _  — - -  [log  n  -  2  log  log  n  +  0(l)]  . 

cr  ^ 2nn 


In  view  of  (4.10), 

kp  =  - -  [1  +  0 (1/log  n)]  . 

cr  v^2nn 


So 

h  /kp  =  log  n-2  log  logn  +  0(1)  .  Q 
We  can  now  prove  the  uniqueness  assertion  in  (1.8). 

(4.31)  Proposition.  Assume  (l.  1-1-6).  Then  max..  (N,-kp.. )  is  assumed 

J  J  J 

at  a  unique  index  j,  with  probability  approaching  one. 

Proof.  We  first  establish: 

(4.32)  Given  e  >  0,  there  are  large,  finite  numbers  a  and  b,  such 

that  eventually,  except  for  probability  e,  max.(N.-kp.)  is 

0  0  0 

assumed  only  for  indices  in  I  ,  and  is  between  z  (-b)  and  z 

-aa  n  n 

l/2 

"Eventually"  means  for  sufficiently  large  values  of  n  and  k/n  /  (log  n 
Indeed,  for  any  b,  except  for  probability  less  than  e/5,  the  maximum 
over  zone  IV  is  eventually  smaller  than  zn(-b)  by  (**••!)•  Likewise  for 
zone  III  by  (4.3), and  zone  II  by  (4.5).  a  matter  of  notation,  zone  I 
is  I  and  can  be  dealt  with  by  (4.9).  There  are  a  and  b  so  large 

®(X)  00 

that  eventually,  except  for  probability  e/5, 


(b). 


z^-b)  <  max{Nj-kpj  for  j  in  )  <  zn(b)  . 


Finally,  choose  a  so  large  that  eventually,  except  for  probability  e/5, 

N.-kp.  <  z  (-b)  for  all  j  e  I  \I  .  This  completes  the  proof  of  (4.32). 
j  n  -oo  oo  -aa 

The  next  two  estimates  are  easily  checked. 


(4.33)  The  number  of  pairs  (j,j*  )  with  j*  /  j  in  J  is  0(n/log  n). 


(4.3)i)  The  number  of  integers  i  with  zfi(-b)  <  i-kp^.  <zn(b)  at 


z  (b)-z  f-b)  =  0[kl/2n-l/U(log  n)"l/2J  . 


n  n 


Now,  lemma  (4.24)  and  relations  (4.33-34)  enable  us  to  estimate 

the  chance  of  finding  two  distinct  indices  j'  /  j  in  I  with 

”8.8. 

N.-kp  =  N.,-kp.,  in  the  critical  range  from  z  (-b)  to  z  (b).  Take 
J  j  J  J  n  n 

the  number  of  pairs  j  /  j'  in  I  ,  and  multiply  by  the  number  of 

■“  cL9# 

integers  i  with  i-kp.  in  the  critical  range.  Each  i  is  a  possible 

J 

value  for  N.,  and  associated  with  it  is  at  most  one  possible  value  i' 

0 

for  N.,  satisfying  i-kp.  =  i’-kp.,.  Then,  we  multiply  by 
J  J  J 

P(N.=i)  •  P(N . (  =i'  | N  =i ).  The  result  is  Otk^^n-1^ (log  n)1^2]  ->Q.  □ 

.1  J  *  j 

The  proof  of  theorem  (1.8)  i.an  now  be  given. 

Proof  of  theorem  (1.8).  As  proposition  (4.31)  implies,  is  assumed 

at  a  uni  cue  location  L^,  with  probability  approaching  one.  Lemmas  (4.1), 
(4.3),  and  (4.5)  imply  that  with  probability  approaching  one,  is  in 

zone  I.  Now,  proposition  (4.9)  implies  (1.8)  by  the  same  argument  which 
showed  that  equation  (2.4)  implies  proposition  (2.2).  □ 


r 


3.  Moment  Assumptions. 

We  have  been  assuming  ;1..')  that  EX^|  <  »  .  An  argument  sketched 
later  in  this  section  will  prove  theorem  1.8)  under  the  weaker  hypothesis 


(5.li 


E{ 1 X,  f  [log(l+  X 


(i  ) 


< 


for  some  5  >  0 


However,  if  only 


O  c 

(5.2)  E{ |X1i"[log(l+|X1| )]°}  <  .c  for  some  5  with  0  <  5  <  1  , 


the  conclusions  of  the  main  theorem  (1.8)  become  false.  This  is 
demonstrated  by  theorem  (5.4)  below,  which  gives  the  correct  asymptotic 
formulae  for  a  particular  random  variable  satisfying  (5-2)  but  not  (5 • 1 ) - 
The  present  discussion  leaves  unresolved  the  caseEflX^  [  log  (1+  [  |  )  ]  )  < 

for  some  8  with  0  <  6  <  1. 

Theorem  (5.4)  uses  an  Edgeworth -like  correction  in  the  local  central 

limit  theorem  for  a  random  variable  without  third  moments,  given  in  ( 5 -  ) • 

The  techniques  used  are  similar  to  those  in  Pitman  (1968).  Some  related 

work  can  be  found  in  Cramer  (1965),  or  Takeuchi  and  Akahira  (I976a,b ). 

We  now  define  a  class  of  probability  distributions  on  the  integers. 

Let  a  >  1  be  given.  Let  q  =  q  be  the  symmetric  probability  on  the 

a 

integers  with 


f  q(0)  =  q(-l)  =  q(l)  =  0 

(5.3)  \  q(j)  =  b/[j;(log| j| )a!  for  j  =  +2,  +3,... 

lb  is  chosen  so  that  q  ( j  )  =  1  . 

'  J 


'\6 


00 


> 


As  for  theorem  (1.8),  let  c  be  the  variance  of  q.  Let  X^,Xg> ...  be  independent 

with  common  distribution  (5 .3),  and  write  Sn  =  + • • •  + Xq.  Take  k  independent 

copies  of  S  .  Let  N.  be  the  number  of  these  sums  which  are  equal  to 
n  j 

j.  Let  =  max^[Nj-kP(Sn=j  )J. 

(5-4)  Theorem.  Let  X^,X2,...  be  independent,  having  common  distribution 
(5.3 ),  with  1.5  <  a  <  2.  Let  c  =  b2a  1/(a-l).  Define 

(5.5)  z  (x, a)  =  / - -  -/log  n  +  c  (log  n)2"a-2  log  log  n  +  x  . 

V  a  -/2rt n 

Then,  with  probability  approaching  one,  is  taken  on  at  a  unique  location 

L^,  and  the  probability  that 

L^  <  y a  /2n/log  n  and  <  zn(x,a) 

converges  to 

41  (y )  exp[  -c r  e_x^2  ] 

as  n  and  k  tend  to  infinity  satisfying  the  growth  condition  (1.5). 

Note  that  q  in  (5-3)  has  a  second  moment, but  just  barely.  In 
particular 

£  j"(log(l+l j|  )]8q(j  ) 


is  finite  provided  6  <  a-1,  but  infinite  for  8  >  a-1.  Note  too  that 
zR^x,a)  defined  in  (5.5)  is  different  from  zn(x )  of  (l.lo).  When 
a  -  2,  the  difference  is  not  so  dramatic — a  constant  under  the  square 
root  sign.  But  when  a  <  2,  there  is  an  extra  term  going  to  infinity, 
namely  c (log  n)2  a. 


To  prove  ((j.4),  sharp  estimates  of  PfS  -j)  are  needed.  This  is 
done  by  approximating  the  characteristic  function  of  X^.  We  begin  with 
some  preliminary  estimates.  The  first  lemma  is  implicit  in  Pitman  (1968). 


(5.6)  Lemma.  Suppose  the  real  numbers  u  are  nonnegative,  nonincreasing 
and  (weakly)  convex: 


u  > 
n  — 


>  u  , 
-  n  *1 


u  -u  >  u  -  - 
n  nQ  —  n+1 


n+2 


Suppose  too  that  u  '  as  n  -*  ■  .  Let  S  be  the  (conditionally  convergent! 
sum  UQ-u^+Up-u, +•  *  •  .  Then 


|  Uq  <  S  <  ^  -  £  ux 


Proof.  Clearly, 


S  =  )  +  (Up-U, )  +  •  •  • 

>  (u^u.  )  1  (Uj-Uj^)  +•••  by  convexity 


This  proves  the  first  inequality.  For  the  second. 


S 


u  -  u  -u  +u . 


1 


> 


because  u  -u  *u  -  •••  >  4  u  by  the  first  inequality  applied  to  the 
1  2  ~  1 

sequence  u^,u  , ...  .  Q 

The  next  lemma  is  at  the  heart,  of  the  approximations  in  this  section. 

Tt  is  abstracted  from  Theorem  of  Pitman  ( 1 ' ) .  We  work  with  conditionally 


convergent  Kiomann  integrals.  Such  integrals  are  denoted  by  using  an  arrow 
over  the  integral  si*.rn. 


(5.7)  Lemma.  Let  H(x)  be  a  convex  function  on  [0,oo)  which  decreases 


to  0.  Then,  for  any  t  >  0, 


(5.8)  |  IQ  <  t  H(x)  sin(tx)  dx  <  IQ  -  ~  I± 


where 


(5.9)  In  =  f  H(x/t)  sin  x  dx  and  I,  -  f  H[  (x+n)/t]  sin  x  dx  . 

u  J  0  '  0 

Proof.  By  changing  variables,  it  is  enough  to  do  the  case  t  =  1. 

Abbreviate  h.(x)  =  H(x+j«).  Split  at  multiples  of  n  to  see  that  the 
J 

integral  to  be  estimated  equals  the  conditionally  convergent  sum 


£°°  /  «(x)  sin  x  dx  =  £  I 

.1=0  •'.in  ,i=0  JO 


(-1/h  .  (x)  sin  x  dx 


Croup  the  terms  as  (u^-i^)  +  (Ug-u^)  +••*  to  get 


f  [h  (x)-h  (x)]  sin  x  dx  . 

“k=0  Jo  ‘ K  1 


Because  H  is  monotone  decreasing,  h  ^  >h^+^:  so  the  last  sum  is 
absolutely  convergent,  and  we  can  take  £  inside  the  integral  by  Fubini1  s 
theorem.  As  a  result, 

,  -v«)  n 

i  H(x)  sin  x  dx  :  T"  [h0.  (x  )-h  ,,  (x )]  sin  x  dx  . 

o  -o  k=0  ~k  kl 

Now  H  is  convex,  so  for  each  x  the  sequence  h^l.x  1 ,  h-^(x),  •••  is 


1+9 


convex.  Ely  ‘-.o'. 


4  h  vx >  <  T  '  ! h  x  '-h  ,  x  )  <!iilx)  -  tMx)  . 

O'  —  u ,  . k  .  k •  1  —  v  I 

K  \ 

'this  completes  the  proof.  Q 

We  will  now  apply  this  result  to  H's  of  order  i / vlog  x)  ,  at 
and  evaluate  the  corresponding  integrals  I,  and  in  (5.7  .  A 

calculus  estimate  will  be  needed. 


Lemma.  Let 

o  b< 

a  pos 

i  Live 

number  and 

x  a 

real  number 

(a^ 

1  tl*x 

^  1  - 

Ox 

for  x 

"•  -1  . 

(b  'i 

i.  a  x 

>°  <  l  - 

Ox  1 

o  -1  ^ 

Ho  *1  V 

for  x 

IV 

i 

vc ) 

i/a-x 

f  .  t  - 

kx 

for  0 

I/- 

X 

1  /. 

with 

k  finite 

but  do 

pending 

on  o 

• 

Proof.  Expand  1  (.Hx^  in  a  Taylor  scries  with  remainder.  G 


(5-lf '  Lemma.  Let 


be  a  positive  number. 


Then,  as  t  ->  0  ■ 


and 


sin  x  dx 


sin  x  dx 


Uog  -) 


-a  -i 


infinity. 


Then, 


are  both  equal  to 


Proof.  We  will  only  do  the  first  integral,  the  second  being  similar, 
write  the  first  integrand  as 

(log  i)  (l  +  1-°^  *)  sin  x  . 
logi 

Now  x  >  /t ,  so  (log  x )/ (log  i)  >  -  ,  and  (5.11a-b)  can  be  used  to 

estimate  the  middle  factor  in  the  display.  The  integrals  which  result 
are  evaluated  as  follows: 

r  * 

i  sin  x  dx  =  2  +  0 (t ) 

r i/t 

f  ^  i 

sin  x  | log  xjd  dx  <  co  for  j  =  1  or  2.  O 

io 

The  following  calculus  facts  are  needed  to  estimate  the  tails  of 
the  distribution  (5.3  ). 


(5-13)  Lemma.  Let  cc  >  0  and  6  >  1, 

(a)  Define 

,  00  q  _a 

f(a,P,u)  =  i  x  (log  x)  dx  . 
u 

Then  f(a,P,u)  is  bounded  belov;  by 


1 

P-1 


-P+l,.,  v-a 
u  (log  u) 


a  -6+1  ,,  s-a  -i 
- 5-u  (log  u; 

(P-l) 


Likewise,  f(a,P,u)  is  bounded  above  by 


1 

P-l 


-P+1,,  v-« 

u  (log  u) 
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Claim  (c).  This  is  clear.  Q 

Of  course,  the  estimates  in  (5.13a-b)  can  be  developed  into  asymptotic 
series. 

We  now  begin  to  investigate  the  probability  density  q  defined  by 
( 5 O ) •  The  following  repeated  integrals  of  the  tail  of  q  will  be 


useful. 


(5.15)  Lemma.  Define  q(j)  and  b  by  (5-5).  For  x  >  0,  set 


F(x)  =  £  q(j) 

|j|  >x 


G(x)  = 

F (u)  du 

Jx 

H(x)  - 

]  G(u)  du  . 
Jx 

Then,  as  x  -*  oo , 

(a)  F(x)  =  bx~2(log  x)”a  +  0[x-2(log  x)-a-1  ] 

(b)  G(x )  =  bx  1(log  x)~a  +  0[x_1(log  x)  a-1  ] 

(c)  H(x)  =  (log  +  0 [  (log  x)'a]  . 

Proof.  This  is  immediate  from  (5. 15).  t  ; 

Note.  Claims  (b)  and  (c)  follow  from  (a).  In  fact,  theorem  (5. 9) 
and  the  supporting  results  (5.17-20-25)  hold  provided  the  X* s  are 
symmetric,  interger-valued,  aperiodic  and  satisfy  (5.15a),  namely, 

P( | X |  >  x)  =  bx  2(log  x)  a  +  0[x  2(log  x)  a  1] 

as  x  •><».  It  is  the  precise  bound  on  the  remainder,  namely 

0[x  2  (log  x)”a"1],  which  enables  us  to  push  the  calculations  throu^i.  In 

2 

this  generality,  there  is  no  connection  between  a  and  b,  or  a  =  Var  X; 
the  latter  is  finite  by  our  condition  on  P(|x|  >  x). 
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Let  p(t)  be  the  characteristic  function  of  q.  Since  q  is 
symmetric , 

(5-16)  <’(t)  =  £  cos  (Jt  )q  (j  ). 

j 

2 

The  next  lemma  gives  an  approximation  for  e.  Recall  that  cr  is  the 

2  n  /  \ 

variance  of  q:  namely,  cr  =  ^  0  q(j). 

j 

(5-17)  Lemma.  Fix  a  >  1.  As  t  tends  to  0, 

0  (t )  -  1  -  |  o-2t"  +  r  (t ) 


where 


r(t)  =  ^  t2(log  -j-ij-)  ahl  +  0 [  t^  (log  ]^]-)"a'J 
and  b  is  defined  in  (5-3). 

Proof.  By  symmetry,  it  is  enough  to  do  the  case  where  t  -♦  0*.  Integrating 
by  parts  three  times  gives 

(5«l8)  (,(t)  -  1  -  ~  a  t‘‘  +  r(t) 


where 


r(t)  t;  f  sin(tx)  H(x)  dx  , 

J() 

and  H  is  defined  in  (5.15).  Clearly,  H  is  convex  and  H(0)  <00 

r) 

because  <j~  <  on,  Now  (3-7 )  shows  that 
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(5.i9)  t2  |  I0  <  r(t)  <  t2(I0-|  Ix) 

with  i  and  1^  defined  as  in  (5. 9)-  To  approximate  1^,  break  the  range  of 
integration  into  two  zones  [0,Vt]  and  [/t,w].  For  the  first  zone. 


0  < 


H(x/t)  sin  x  dx  < 


H<0)/' 


7*  ! 

x  dx  =  ^  H(0) t 


For  the  second  zone,  use  the  approximation  (5.15c)  for  H(x/t),  along 
with  (;>.12).  Combining  the  two  zones  we  haver 


*0  =  (1°s  h  a  1  +  °t  (^g  “) ~a]  . 

The  same  estimate  for  can  be  obtained  by  the  same  argument.  Substi¬ 

tuting  into  (5.19)  completes  the  proof.  □ 

We  will  use  lemma  (5.17)  in  the  following  form: 

(5.20)  Lemma.  Define  q  and  b  by  (5-3).  Let 
istic  function  of  q  defined  in  (5.16).  Let  cr2 
Then,  as  t  -^-0, 


e(t)  be  the  character - 
be  the  variance  of  q. 


exp  (?)  c2t2)0(t)  =  l+y(t)+p(t) 


where 


y  (t) 


b 

a-1 


t2  (log 


1  \-a+l 

ItT3 


and 
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p(t)  -  0[t  (log  -j-ij)  a]  . 


The  next  two  facts  are  well  known,  but  are  recorded  here  for  ease  of 
reference. 

(5.21)  Lemma.  Let  z.  and  z\  be  complex  numbers  of  absolute  value 

<3  3 

at  most  A.  Then 


11?  z.  -  n?  ... 
j^i  j  j=i  j 


-  "'Ll2;'  <  a"  En 

5=1  J  3 


(5.22)  Lemma.  Let  n  be  a  positive  integer,  and  z  a  complex  number. 
Then 

1  2  ,  ,2  n| z I 


,  _  Nn  _  1  .  1  d.  1  1  d.  r 

(l+z )  -  1  -  nz  |  <  r-  n  |  a|  e 


We  are  now  ready  to  give  an  approximation  for  the  probability  density 

of  Sn-  We  will  abbreviate  the  normal  density  with  mean  0  and  variance 

2 

a  by  cp  : 

cr 


1  1  y  /  2 

4*  (x )  =  -  exp(-  ^  x/a  )  . 

a  \/2n  2 

Also,  we  introduce  the  Hermite  polynomial: 

h^Cx)  -  (x/ct)2-1  . 

(5-23)  Proposition.  Fix  a  >  1.  Lft  X^,X^,...  be  independent  with  common 
distribution  (5-3  )•  Let  Sn  --  +  •  •  •  +  Xn»  Hien,  as  n  -»  00, 

/n  P(r»n  j)  =  qPCT(j.Vn)U-enhcT(dA/n)]  +  r Jn  , 
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(5.23) 


where 


cr2  =  Var  X  , 

e  =  ~~T" /  (log  </n)a  1  with  b  as  in  (3.3)  > 

il 


and 


d  =  min(a,2a-2)  >  a-1 


r^n  =  O(log  n)  d  uniformly  in  j  . 

Note.  Similar  techniques  would  give  more  terms  in  an  Edgeworth -like 
expansion. 


Proof.  Our  argument  is  adapted  from  section  16.2  of  Feller  (1971).  By 
the  Fourier  inversion  formula. 


(5-24) 


(  00  1  O  O 

qPo.(x)  =  ~  I  exp(-itx)  exp(-±-  at  )  dt 

J  — oo 


Differentiate  twice  with  respect  to  x: 


(5.25) 


i  2  12  2 

(x )<p^(x )  =-  -  ~  exp(-itx)  t  exp(--  a  t  )  dt 

ixi 


CT  or 


Likewise, 


P(S„=J ) 


j.  r 

2n  I 

J  -IT 


exp  (-it  j)  e(t)  dt  . 


Changing  variables, 


(5.26) 


i  f  "V'n  _  _  n 

v' n  p(sn=J)  =  _  exp(-itj/Vn)  P(t/Vn)  dt 

J-nJ  n 
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Combining  (5.24- '5-2o ),  we  have 


rJn  =  »^n  P(Sn^j)  -  «Pff(j//F)  +  env0.(j//n)ha.(j//n) 
=  Jx  +  J 


where 


■  n/n 


i  r 1 

p7  _  exp(-it j//n)  \|r  (t)  dt 
J-n/n  11 


with 


(5.27)  1rn(t)  o(t/Jn)n-  (lKnt1!)  exp(-  ^  <x2t  ) 


and 


I 

'2  "  2«  J, 


1 1 1 >n/n 


exp(-itj//n)  (1+6  t2)  exp(-  ^ 


Clearly, 


'2 1  -  2rc  J.  ,  r  (1+tnt‘1)  exp(-  I  dt  * 

;|t  |  >"/n 


so  Jp  is  negligible. 
Likewise, 


UJ  <oT 


)^n(t)|dt  . 


(5.28) 


We  split  the  interval  of  integration  in  (5-28)  into  three  sub-intervals: 
1^,  the  set  of  t  with  ] 1 1  <  n1^ 

R^,  the  set  of  t  with  <  |t|  <  8n^ 

*1  /o  i  /p 

R^,  the  set  of  t  with  6n  <■  1 1 1  <  ffn  • 

We  will  choose  8  >  0  later. 

Consider  R^.  The  distribution  q(j)  of  (5.3)  is  aperiodic,  so 
SUPjt|>8  ^  ^  ^  =  \  <  1.  Of  course 

kn(t)|  <  0  (t/^/n)n  +  (l+ent2)  exp(-  i  a2t2)  . 

So  the  contribution  from  to  the  integral  in  (5-28)  is  at  most 

\n  ■/n  +  (1+e  t2)  exp  (-  ^cr2t2)  dt  . 

-'it!  >&/H  n  2 

This  is  negligible. 

Next  consider  R,,.  Using  (5. 17),  we  may  choose  &  >  0  so  small 

that 


|S(u)|  <1  -  ^  cr2u2  <  exp(-  j|-  a2u2) 
for  |u|  <8.  This  fixes  the  &  defining  R^.  In  particular, 

le(t/yn)|n  <  exp(-  ^  a2t2) 

for  all  t  in  R  .  As  a  result,  the  contribution  from  R^  to  the 
integral  in  (5.28)  is  negligible. 
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We  now  show  that  the  contribution  from  R^  to  the  integral  in 
S.28)  is  0 (log  n)  a  +  0 (log  n  '  a.  Let 


f'  (t)  exp(^  it  t'  /n)  6{t/fn)  -  1  . 

IX  r_ 

Recall  the  definition  of  from  (5.27).  Clearly 


(5*29)  tn(t)  =  {[l+8n(t']n  -  1  -  e^t'  }  exp(-  ~  cr  t2)  . 

Recall  y  and  p  from  (5. 20).  As  that  result  shows, 


(5-30)  Pn(t)  =  7  +  P (t//n)  . 

Use  (5.20)  again:  there  are  positive,  finite  constants  such  that 

for  all  t  in  R1 

(5.31)  |  np (t//n)  |  <  K^/Clog  n)a 
and  then 

(5.32)  |nPn(t)|  <K,tV(log  n)a_1  ; 
so 

(5*33)  !nPn(t)j  <  o2t^  for  all  large  n  . 


We  are  estimating 


Using  (5-29),  (5.  30),  and  the  triangle  inequality,  we  have 


•),  <  ,1 

1  —  a 


J 

c 
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where 


Ja  2n 

f  |np(t//n)|  exp(-  i-rr^t2)  dt 

*'RL 

J  --i 
Jb  "  2it 

j  |[  1+pn(t)]n  -  [  l'n&n(t )]  j  exp(-  n2t^  )  dt 

Jri 

J  -  -i 

Jc  2n 

|  ,nr(t/Vn)  -  ent2|  exp(-  -  cr' t2 )  dt  . 

Jr  O 

R1 

We  estimate  J  , 
a 

using  (5-31): 

Ja^27 

(log  n)-a  J'  t“  exp  (-  |  cr2t2)  dt 

=  0  (log  n )”  . 

We  estimate  J,  ,  using  (5.22): 

D 

l[l+f  (t)]n  -  1  -  nB  (t)i  <  -t  I nP  (t )  |  '  exp(  |nB  (t)|  ) 
n  n  —  c.  n  n 

In  view  of  (5.32-33  ),  this  bound  is  at  most 

T  (log  n)  ‘^a  t*  exp (j^-  cr"t  )  . 

So 

Jb  1  dr  *2  (log  a  f  t>J  exp(-  r  cr2t2)  dt 
=  Oflog  n)2_2a  . 


(>1 


Finally,  we  estimate  .1  .  This  is  mor.  delicate,  and  it  is 

c 

.'onvenient.  to  treat  two  .-.ones  separately.  Let 


R,  b.  the  set  of  t  with  1  <  it  I  <  n 


1/  • 


So 


R,  be  the  set  of  1  with  |t|  <1  . 


«l  =R1U  Rj  .  Using  the  definition  of  >  in  (b.20),  and  the 


definition  of  e  in  (5.;V  ),  the  contribution  of  J  from  r(*  is 
n  o  1 


_L  Jl_ 

a-l 


,  \  a-l 

V  log  V  n  1 


times 


(5*^5)  f  i  1(1-  lc  ®  )  -  1  :  t"  exp(-  \  cr‘  t‘‘ )  dt  . 

log  v  n 

The  integrand  in  (b.  S)  can  be  estimated  by  (b.llc),  because  j  t  j  <  n1 
in  so  log 1 1!  /.log  yn  <  4  in  R^ .  Hence  ( b  • ■'  1  •> )  is  at  most 

- —  J  logjtl  )  t'  exp '  -  -  cr"t'  )  dt  . 

log  y  n  |t |  >  1 


This  is  '  (log  n)  \  Thus,  the  contribution  to  J  from  R*  is  at 

c  1 

most  0  (log  n)  a.  Likewise,  the  Contribution  to  J  from  R,~  is 

c  1 

the  factor  ,5oM  times 


log(l,  1 1 1  )  ,1-a 
log  >  n 


l|  t  exp(-  -  <j  t'  )  dt 


By  lb.  lib),  this  is  at  most 


a-1 

log  y'n 


i: 


(log  piy)  t‘  exp(--  cr‘  t‘  )  dt 


.a 


a  (a- 


(log  ,  Jl) 


-  f 

nV  J-l 


Oog  yq-)"  t 


-  12  2. 

txp(-  —  cr  t 


dt 


Tliis  too  is  Oilog  Thus,  the  contribution  to  from  is  also 

0 (log  n)  a.  D 

We  are  now  in  position  to  finish  the  proof  of  (5.4).  The  argument  is 
very  similar  to  the  pro  f  of  theorem  (1.8)  and  we  just  indicate  the  changes. 


Proof  of  (5.4 ).  Tin  bounds  on  the  maximum  deviation  in  zones  IV,  III,  and 
II,  as  given  in  (4.1),  (4.A),  and  1.U.5)  go  through  without  change.  The 
estimates  used  in  are  valid  as  stated,  except  that  (4.10b)  becomes: 


(5oo^  p.  P .  S  -  i )  — - -  n  .  ,  where 

'  ,1  n  1  . -  ,i  n 

O’  ^  .  'nil 


uniformly  in 
follows  from 
and 


tin  '  1  -  >'  ■'  •• 

j  <■  zone  I;  the  M'(6'  )" 
(%."■'  ).  Indeed,  since  j 


(2o  n)  *  (log  /n)1_a 

a.**  1. 

is  uniform  in  j  and 
is  confined  to  zone  I, 


,  ,2-2a. 

+  0 ,  (log  n )  ) 


n.  This 

j'  fn  =  0(6-  ), 


exp[  -X /(2c  n)  I  =  I  -  [1*  0(6“"))  j‘/(;lT‘  n) 


We  must  now  multiply  by 


'log  >  n ) 


1-a 


•V  n 


b 

a-1 


We  get 


1 


0(&  )\i'  2cr  n  -  —■  (logVn)a  1  '  T 


where 


T  -  aogln)J"a  1  +  -  1)1 

2iT^n  Per*'  n 


.  a  n 

because  a  >  Ij  and  T  can  be  merged  into  the  lead  term  of  • 

Since  a  <  2, 

d  -  min(a,  2a.-.?)  =  2a-  , 

2-  'a 

accounting  for  the  error  term  O(log  n)‘  in  r.  .  Instead  of 

(4.1  we  must  estimate 

1  ’•  .  -  1  i  fl  •  0(p>  ■'  j“/(2cr' n) 

a) 

b  -^1-a  .  {2-2  a 

"  a^l  (log,»  n'  ‘-(log  n)  , 

the  higher  powers  of  the  lead  term  merging  into  the  0 (6^ ) ,  because  for 
,i  e  zone  I, 

>  •  >  >  a  ■> 

U'  A'cr  nV  (b  '  •  jV-'  o'  n  . 


I 

l 


6 


! 

c  (log  n)2_a  .  1  , 
a-1  . 

because  c  was  chosen  to  be  2  b/(a-l).  Furthermore, 

:  U  •  ..)(log  n)2_2a  =  o(l)  , 

because  we  imposed  the  condition  a  >  1.5>  so  3-2a  <  0. 

Hie  remaining  arguments  used  in  (1.8)  go  through  with  only  minor  changes.  Q 
Finally,  we  indicate  why  theorem  (1.8)  continues  to  hold  when  the 
third-moment  condition  (l . :? )  is  replaced  by  the  log-moment  condition 
(5.1).  Let 

F(x)  --  P(X1  >  x)  for  x  >  0 

* 

!  -  P(X1  <  x)  Tor  x  <  0  . 

By  Chebychev* s  inequality, 

(5. .57)  F(x )  Olx^  (log|x|  )"c  "5j  as  x  -*  «  . 


Let 


(5.38) 


G(x)  =  0  [|x|_1(log|x|  )“2-6]  as  x  -*■  +  «>  . 


Let 


H(x) 


G(u)  du 

G(u)  du 


for  x  >  0 


for  x  <  0  . 


Using  (5.13)  again, 

(5.39)  H (x )  =  0[  (log|x|  )  1_8]  as  x  -*  +  »  . 


Now,  if  0 (t )  is  the  characteristic  function  of  and  u  =  E 

we  can  argue  as  in  (5.17)  to  show  that 


0 (t )  =  1  +  iMt  -  |  E(X^)t2+  r(t) 


where  as  t  -*  0, 


r(t)  ^  0[  1 t  j2 (log  • 

Consequently,  by  an  argument  like  the  one  used  for  (5.23)  but 
without  the  Hermite  polynomial, 

(5.U0 )  /n  P(Sn=j)  =  qp^U-nji)  +  0(log  n)-1"6  , 
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where  t?~  ■  Var  X^.  The  approximation  can  be  used  in  place  of  the 

local  central  limit  theorem  (  .^7)  in  proving  (1.8).  An  approximation 
similar  to  (%'>0)  based  on  fractional  moments  (rather  then  log  moments) 
is  given  by  Ibragimov  (Ki66). 
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